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AAGTAAAAGAAAGAGCGAGAAATCATCGAAATGGATTTCATCTCATCTCTTATCGTTGGC 

1 — +-- — + — + +_ + 60 

TTCATTTTCTTTCTCGCTCTTTAGTAGCTTTACCTAAAGTAGAGTAGAGAATAGCAACCG 

a K*KKE R EIIE (§) DFISS L IVG - 
b SKRKSEKSSKWISSHLLSLA - 
C V K E R A R N H RN G F HL I S Y RW L- 

TGTGCTCAGGTGTTGTGTGAATCTATGAATATGGCGGAGAGAAGAGGACATAAGACTGAT 
61 + 12 Q 

ACACGAGTCCACAACACACTTAGATACTTATACCGCCTCTCTTCTCCTGTATTCTGACTA 

a C AQVLCE SMNMAERRGHKTD - 
b V L R C C V N L * I W R R E E D IRLI - 
c CSGVV* IYEYGGEKRT * D * S - 

CTTAGAC^GCCATCACTGATCTTGAAACAGCC^TCGGT 

121 — — — + + ^--_ + --""~+-"------+ iBO 

GAATCTGTTCGGTAGTGACTAGAACTTTGTCGGTAGCCACTGAACTTCCGGTATGCACTA 

... .... „ 4 

a LRQAITDLETAI GDLKAI R D - 
b L D K P S L I L K Q P S V T * R P Y V M - 
c * T S H H * S - * N S H R * LEG H T * * 

GACCTGACTTTACGGATCCAACAAGACGGTCTAGAGGGACGAAGGTGCTCAAATCGTGCC 
181 -------^+----.-----+----- +---- 240 

GTGGACTG71AATGCCTAGGTTGTTCTGCCAGATCTCCCTGCTTCGACGAGTTTAGCACGG 

a DL T LRI Q Q D G L E GR S C S N R A 

b T * L YG S NK TV * RD E A A Q I V P - 

c PDFTDPTRR SRGT KL LKSCQ- 

AGAGAGTGGCTTAGTGCGGTGCAAGTAACGGAGACTAAAACAGCGCTACTTTTAGTGAGG 
241 + „„_™_ + __„„„ + 3 0 o 

TCTCTCACCGAATCACGCCACGTTCATTGGCTCTGATTTTGTCGGGATGAAAATCACTCC 

a REWLS AVQVT ETK TA L LLV R - ■ 
b ESGLVRCK* RRLKQPYF * * G - 
c RVA*'CG ASN6D *NSPTFS EV- 

TTTAGGCGTCGGGAACAGAGGACGCGAATGAGGAGGAGATACCTCAGTTGTTTCGGTTGT 

301 — —+-——+ +- — — + 360 

AAATCCGCAGCCCTTGTCTCCTGCGCTTACTCCTCCTCTATGGAGTCAACAAAGCCAACA 

a FRRREQRTRMRRRYL S CF G C - 
b LGVGNRGRE * GGD TS VVSVV- 
C * AS G T E D AN E E E I P Q L F R L C - 

GCCGACTACAAACTGTGCAAGAAGGTTTCTGCCATATTGAAGAGCATTGGTGAGCTGAGA 

361 + + +— +- +— + 420 

CGGCTGATGTTTGACACGTTCTTCCAAAGACGGTATAACTTCTCGTAACCACTCGACTCT 



Fig. 2A 
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a ADYK LCK KV SAI LKSIGELR 

b PTTNCARRFLPY * R A L V S * E - 

c RLQ T VQ E G FCH I EE H W * A E R - 

GAACGCTCTGAAGCTATCAAAACAGATGGCGGGTCAATTCAAGTAACTTGTAGAGAGATA 

421 -™+ + ~— + — ~f — + 480 

CTTGCGAGACTTCGATAGTTTTGTCTACCGCCCAGTTAAGTTCATTGAACATCTCTCTAT 

a ERSEAIKTDGGS IQVTCREI 

b N A L K L S K Q M A G Q F K * L V E R Y - 

C T L * SYQNRWRVN S SN L * RDT- 

CCCATCAAGTCCGTTGTCGGAAATACCACGATGATGGAACAGGTTTTGGAATTTCTCAGT 

481 — --+-- ——---+— — — - + r -~ +— — + 540 

GGGTAGTTCAGGCAACAGCCTTTATGGTGCTACTACCTTGTCCAAAACCTTAAAGAGTCA 

a P I K S V V G N T T M M E QVL E F L S - 
b P S S P L S E I P R * WNR FWN FSV- 
c HQVRCR KYHD D G TQFG I S Q * - 

GAAGAAGAAGAAAGAGGAATCATTGGTGTTTATGGACCTGGTGGGGTTGGGAAGACAACG 
541 _„ + _^-„„„ + 6 oo 

CTTCTTCTTCTTTCTCCTTAGTAAC(^CAAATACCTGGACCACCCCAACCCTTCTGTTGC 

a EEE ERG I I GVYGPGGVGKTT 

b KKKKE E S LVFMD LVG L GRQ R- 

C R R R K R N H W C L W T W W G W E D N V- 

< 

TTAATGCAGAGCATTAACAACGAGCTGATCACAAAAGGACATCAGTATGATGTACTGATT 

601 — +- — + + +-- — - +— + 660 

AATTACGTCTCGTAATTGTTGCTCGACTAGTGTTTTCCTGTAGTCATACTACATGACTAA 

a LMQSINN EL ITKGHQYDVLI - 
b *CRALTTS* SQ KDISMMY*F- 
C N A E H * QRADHKRT SV * C TDL- 

TGGGTTCAAATGTCCAGAGAATTCGGCGAGTGTAGAATTCAGCAAGCCGTTGGAGCACGG 

661 — -+ +~- +— — +— — + 720 

ACCCAAGTTTACAGGTCTCTTAAGCCGCTCACATGTTAAGTCGTTCGGCAACCTCGTGCC 

a WVQMS R E FGE C T I QQA V G A R - 
b GF KCP E NS ASVQFSKPL EHG- 
c GSNVQRI RRVYNSASRWSTV- 



TTGGGTTTATCTTGGGACGAGAAGGAGACCGGCGAAAACAGAGCTTTGAAGATATACAGA 

721 — _+-——+ + — + -+ + 780 

AACCCAAATAGAACCCTGCTCTTCCTCTGGCCGCTTTTGTCTCGAAACTTCTATATGTCT 

a L GLS WDEKE TGENRALKIYR - 
b WV YL GTRRRPAKTEL*RYTE - 
c GFILGREGDRRKQSFEDI QS- 

GCTTTGAGACAGAAACGTTTCTTGTTGTTGCTAGATGATGTCTGGGAAGAGATAGACTTG 

781 + +«- + 840 

CGAAAGTCTGTCTTTGCAAAGAACAACAACGATCTACTACAGACCCTTCTCTATCTGAAC 

Fig. 2B 
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a AL RQKRFLL LLDD VW EE IDL 
b L*DRNVSCC C* MMSGKR * TW - 
c FE TE T F L V V A R * C L G RDR L G- 

GAGAAAACTGGAGTTCCTCGACCTGACAGGGAAAACAAATGCAAGGTGATGTTCACGACA 
841 +„. 900 

CTCTTTTGACCTCAAGGAGCTGGACTGTCCCTTTTGTTTACGTTCCACTACAAGTGCTGT 

a E K TGVPRPDRE NKCKVMFTT - 
b RKLE F L D, L T GKT NAR * C S R H - 
C ENWSSST*QGK Q M Q G D V H D T - 

CGGTCTATAGCATTATGCAACAATATGGGTGCGGAATACAAGTTGAGAGTGGAGTTTCTG 
g01 --„ H .-_„„„ H ._--„„__ + „..„:.-„ + -+ 960 

GCCAGATATCGTAATACGTTGTTATACCCACGCCTTATGTTCAACTCTCACCTCAAAGAC 

a RS IALCNNM G AEYKL RVEFL 
b G L * H Y A T I WVRN T S * E W S F W - 
c VYS IMQ QYGCG I QV E S GVS G - 

GAGAAGAAACACGCGTGGGAGCTGTTCTGTAGTAAGGTATGGAGAAAAGATCTTTTAGAG 
961 —+———+« -„ + „ _ + 10 20 

CTCTTCTTTGTGCGCACCCTCGACAAGACATCATTCCATACCTCTTTTCTAGAAAATCTC 

a EKKHAWEL FC S K V W R K D L L E 
b RRNT R GS CSV VRY GEKIF* S - 
C E ETRVGA VL* *GMEKRSF R V- 

TCATCATCAATTCGGCGGCTCGCGGAGATTATAGTGAGTAAATGTGGAGGATTGCCACTA 

1021 — ^— +— -+———+- ™_ + 10 80 

AGTAGTAGTTAAGCGGCCGAGCGCCTCTAATATCACTCATTTACACCTCCTAACGGTGAT 

a SSSIRRLAEIIVSKCGGLPL 
b HHQ FAGSRRL* *VNVEDCH * 
C I I N S P A R G D Y S E * M W R I A T S - 

GCGTTGATCACTTTAGGAGGAGCCATGGCTCATAGAGAGACAGAAGAAGAGTGGATCCAT 

1081 — — +— — + + •+-- — •+ 1140 

CGCAACTAGTGAAATCCTCCTGGGTACCGAGTATCTCTCTGTCTTCTTCTCACCTAGGTA 

a ALITLGGAMAHRETEE EWIH 

b R* SL*EEPWLIERQKKSGSM- 

C V D H F R R S H G S * RDRRRVD P C- 

GCTAGTGAAGTTCTGACTAGATTTCCAGCAGAGATGAAGGGTATGAACTATGTATTTGGC 

1141 — + — — + — +— - — + — + 1200 

CGATCACTTCAAGACTGATCTAAAGGTCGTCTCTACTTCCCATACTTGATACATAAACGG 

a A S E V L T R F P A E M K G M N Y V FA - 
b LVKF*LDFQQR*RV*TMYL P- 
c **SSD*I SS RDEGYELC ICP- 

CTTTTGAAATTCAGCTACGACAACCTCGAGAGTGATCTGCTTCGGTCTTGTTTCTTGTAC 

1201 — — + + + — + + 1260 

GAAAACTTTAAGTCGATGCTGTTGGAGCTCTCACTAGACGAAGCCAGAACAAAGAACATG 

Fig. 2C 
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a LLKFSYDNLES DLLRSCF LY - 
b F * N S A T T T SRV I C FG L VS C T - 
c FE I QLRQP RE * SASVLFLVL- 

TGCGCTTTATTCCCAGAAGAACATTCTATAGAGATCGAGCAGCTTGTTGAGTACTGGGTC 
1261 — +- +- + ----+ + 1320 

ACGCGAAATAAGGGTCTTCTTGTAAGATATCTCTAGCTCGTCGAACAACTCATGACCCAG 

a C ALFPEEH SIE IEQ LVEYWV 

b ALY SQ KN I L* RS S S L L STG S - 

c R F I P RR T FY R DRA A C * VL G R - 

GGCGAAGGGTTTCTCACCAGCTCCCATGGCGTTAACACCATTTACAAGGGATATTTTCTC 

1321 — + +— + + 1380 

CCGCTTCCCAAAGAGTGGTCGAGGGTACCGCAATTGTGGTAAATGTTCCCTATAAAAGAG 

a GEGFLTS SHGV NT IYKGYFL 

b AKG FSPA PMALTPFT R D I F S - 

c RRVS H Q L PWR * H H LQ G I F S H- 

ATTGGGGATCTGAAAGCGGCATGTTTGTTGGAAACCGGAGATGAGAAAACACAGGTGAAG 

1381 — — — - +--- — r— +— + — + 1440 

TAACCCCTAGACTTTCGCCGTACAAACAACCTTTGGGCTCTACTCTTTTGTGTCCACTTC 

a I G D LKAAC L L ET GD E K T Q V K 

b L G I * KRHV CWKPEMR KHR* R - 

c WG SE S G MFV GNRR * E N T GE D- 

ATGCATAATGTGGTCAGAAGCTTTGCATTGTGGATGGCATCTGAACAGGGGACTTATAAG 
1441 + + + -+ +- -+ 1500 

TACGTATTACACCAGTCTTCGAAACGTAACACCTACCGTAGACTTGTCCCCTGAATATTC 

a MHNVVRS FAL W N A S E Q G T Y K 

b C I M W S EALHCGWHLNRGLI R - 

c A * CGQKL CIV DGI *TGD L*G- 

GAGCTGATCCTAGTTGAGCCTAGCATGGGACATACTGAAGCTCCTAAAGCAGAAAACTGG 

1501 — + +- -+ — +- — -~+ 1560 

CTCGACTAGGATCAACTCGGATCGTACCCTGTATGACTTCGAGGATTTCGTCTTTTGACC 

a EL I LVE PSMGHTEA PK A E N W 

b S * S * L S LAWD I LK LLKQKT G - 

C A D P S *- A * H G T Y * S S * S R K L A - 

CGACAAGCGTTGGTGATCTCATTGTTAGATAACAGAATCCAGACCTTGCCTGAAAAACTC 

1561 — + ----+--"-----+------—-+ +~— + 1620 

GCTGTTCGCAACCACTAGAGTAACAATCTATTGTCTTAGGTCTGGAACGGACTTTTTGAG 

* 

a RQ ALVI SLLDN R IQ TLPEKL - 
b DKRW*SHC *ITE SRPCLKNS- 
C T SVGDLIVR*Q NPDLA*KTH- 
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ATATGCCCGAAACTGACAACACTGATGCTCCAAGAGAACAGCTCTTTGAAGAAGATTCCA 

1621 +— — — - +-™ + 1680 

TATACGGGCTTTGACTGTTGTGACTACGAGGTTGTCTTGTCGAGAAACTTCTTCTAAGGT 

a l C PKLTTLMLQQNSSL KKIP 

b Y A RN * Q H * CSN RTAL * R RFQ - 

c MPETDNTDAPTEQLF.EE DSN- 

ACAGGGTTTTTCATGCATATGCCTGTTCTCAGAGTCTTGGAGTTGTCGTTCACAAGTATC 

1681 +• :+-- — + 1740 

TGTCCCAATIAAGTACGTATACGGACAAGAGTCTC^GAACCTGAACAGCAAGTGT^ 

a TGFFMHMP V LRVL DLSFT SI 

b QGFSC ICLFSESWTCR SQVS - 

C RVFHAYACSQSLGLVVHKYH- 

ACTGAGATTCCGTTGTCTATCAAGTATTTGGTGGAGTTGTATCATCTGTCTATGTCAGGA 

1741 + + + 1800 

TGACTCTAAGGCAACAGATAGTTCATAAACCACCTCAACATAGTAGACAGATACAGTCCT 

a TE IPLSIKYLVELYH LSM SG 

b LRFRCLSSI WWSCII C LCQE- 

c * D SVVYQ VFGGV VS S VYVRN- 

ACAAAGATAAGTGTATTGCCACAGGAGCTTGGGAATCTTAGAAAACTGAAGCATCTGGAC 

1801 ——+———+ ---,-+—-- + +- + i860 

TGTTTCTATTCACATAACGGTGTCCTCGAACCCTTAGAATCTTTTGACTTCGTAGACCTG 

a TKI SVL PQELGNL RKLKHLD - 
b QR* VYCHRSLGILEN*S I W T - 
c KDK CIAT GAWE S * K TEASGP- 

CTACAAAGAACTCAGTTTCTTCAGACGATCCCACGAGATGCCATATGTTGGCTGAGCAAG 
1861 + — -+ + + + „ + 1920 

GATGTTTCTTGAGTCAAAGAAGTCTGCTAGGGTGCTCTACGGTATACAACCGACTCGTTC 

a LQR T QFLQ TIP R DAI CWLSK - 
b Y K E L S F F R R S H E M P Y V G * A S - 
c TKNSVS S D-D PT RG HMLAEQA- 

CTCGAGGTTCTGAACTTGTACTACAGTTACGCCGGTTGGGAACTGCAGAGCTTTGGAGAA 

1921 + + +~ + +- — + 1980 

GAGCTCCAAGACTTGAACATGATGTCAATGCGGCCAACCCTTGACGTCTCGAAACCTCTT 

a LEVL NL Y Y S YAGWE L Q S F G E 

b S R F * T C T TV T P V G N C R A L E K - 

c R GSE LVL QLRRLGTAELWRR- 

GATGAAGCAGAAGAACTCGGATTCGCTGACTTGGAATACTTGGAAAACCTAACCACACTC 

1981 — +— -+— •+ + -+ + 2040 

CTACTTCGTCTTCTTGAGCCTAAGCGACTGAACCTTATGAACCTTTTGGATTGGTGTGAG 



Fig. 2E 
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a DEAEEL GFADLEYLEN LTTL 

b MKQK NSD SLTWNTWKT * PH S - 

C * S RRT RI R * L 6.1 LGK P NHT R- 

GGTATCAGTGTTCTCTCATTGGAGACCCTAAAAACTCTCTTCGAGTTCGGTGCTTTGCAT 

2041 — + + + + +— + 2100 

CCATAGTGACAAGAGAGTAACCTCTGGGATTTTTGAGAGAAGCTCAAGCCACGAAACGTA 

a GITVLSL ETLKTLFEFGALH 

b V S L F S H W R P * K L S S S S V L C I - 

c YH C S L I GDPKN S LRVRC FA * - 

AAACATATACAGCATCTCCACGTTGAAGAGTGCAATGAACTCCTCTACTTCAATCTCCCA 

2101 — + — „ + + 2160 

TTTGTATATGTCGTAGAGGTGCAACTTCTGACGTTACTTGAGGAGATGAAGTTAGAGGGT 

a KHIQHLH VEE CNE LLYFNLP 

b N I Y S ISTLKS.AMNSSTSISH-r 

c T Y T A S P R * R V Q * T P L L Q S P I - 

TCACTCACTAACCATGGCAGGAACCTGAGAAGACTTAGCATTAAAAGTTGCCATGACTTG 
2161 — — „+ _„„+- . + „-^„^_ + „ „ + 2220 

AGTGAGTGATTGGTACCGTCCTTGGACTCTTCTGAATCGTAATTTTCAACGGTACTGAAC 

a SLTNHGRNLRRLSI KSCHDL 

b H S L T MA G T * E D LA L K VA MT W - 

c T H * P W Q E P E K T * H * K L P * L G - 

GAGTACCTGGTCACACCCGCAGATTTTGAAAATGATTGGCTTCCGAGTCTAGAGGTTCTG 

2221 — +- w -h— + 2280 

CTCATGGACCAGTGTGGGCGTCTAAAACTTT^ 

a EY LVTPA DFENDWLPSLEVL 

b STWSHPQI LKMIGFRV*RF* - 

c VP GHTR RF* K * LASE S R G S D - 

ACGTTACACAGCCTTCACAACTTAACCAGAGTGTGGGGAAATTCTGTAAGCCAAGATTGT 

2281 — +— ~ 2340 

TGCAATGTGTCGGAAGTGTTGAATTGGTCTCACACCCCTTTAAGACATTCGGTTCTAACA 

a TLHSLHNLT RVWGNSVSQ DC - 
b RYTAFT T* PECGEIL*AK IV- 
c VTQP SQLN QSVGK FCKPRLS-^ 

CTGCGGAATATCCGTTGCATAAACATTTCACACTGC^ 

2341 ---—+■ — +„_„„„+ 2400 

GACGCCTTATAGGCAACGTATTTGTAAAGTGTGACGTTGTTCGACTTCTTACAGAGTACC 

a LRN IRCI NI SHCNKL KNVSW - 
b C GISVA*TFHTATS*RMSHG- 
c . A E Y P L H K H F T L Q Q A E E C L M 6- 

GTTCAGAAACTCCCAAAGCTAGAGGTGATTGAACTGTTCGACTGCAGAGAGATAGAGGAA 

2401 -— + + + --+-- +— + 2460 

CAAGTCTTTGAGGGTTTCGATCTCCACTAACTTGACAAGCTGACGTCTCTCTATCTCCTT 

Fig. 2F 
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a VQ KLP KLEVIELFDCR EIEE 

b FRNSQS * R * LNC S T A E R * R N - 

c S E T PKARGD * T VRLQR DRG I - 

TTGATAAGCGAACACGAGAGTCCATCCGTCGAAGATCCAACATTGTTCCCAAGCCTG7VAG 

2461 — -+ + + + — — + 2520 

AACTATTCGCTTGTGCTCTCAGGTAGGCAGCTTCTAGGTTGTAACAAGGGTTGGGACTTC 

a LI SEHESPSVEDPTLF PSL K 

b * * A N T R V H P SK I QHC S Q A * R - 

c D K R T RE S I R R R S N I V P K P E D - 

ACCTTGAGAACTAGGGATCTGCCAGAACTAAACAGCATCCTCCCATCTCGATTTTCATTC 

2521 « — +-,-- + 2580 

TGGAACTCTTGATCCCTAGACGGTCTTGATTTGTCGTAGGAGGGTAGAGCTAAAAGTAAG 

a T LR T R D L P E L N S I L P S R F S F 
b P * E L G I CQN * T AS S H L D FH S 
c L E N * G S A R T K Q H P P I S I F I P - 

CAAAAAGTTGAAACATTAGTCATCACAAATTGCCCCAGAGTTAAGT^AACTGCCGTTT 

2581 — — — - + + — +— — + 2640 

GTTTTTCAACTTTGTAATCAGTAGTGTTTAAGGGGGTCTO^TTCTTTGACGGCAAAGTC 

a Q K V E T L V I T N C P R V K K L P F Q - 
b K K L K H * S S Q I A P E L R N C R F R - 
c K S * N I S H H K L P Q S * E T A V S 6 - 

GAGAGGAGGACCCAGATGAACTTGCCAACAGTTTATTGTGAGGAGAAATGGTGGAAAG^ 

2641 —--+-- -——+-• — — + — — +~ + 2700 

CTCTCCTCCTGGGTCTACTTGAACGGTTGTCAAATAACACTCCTCTTTACCACCTTTCGT 

a ERRT Q MNL PTVYC EE KWW KA 

b RGGPR* TCQQ F I VR RNGGK H - 

c E E D P D E L AN S L L * G E NVE S T - 

CTGGAAAAAGATCAACCAAACGAAGAGCTTTGTTATTTACCGCGCTTTGTTCCAAATTGA 

2701 + — ----+-- + + + 2760 

GACCTTTTTCTAGTTGGTTTGCTTCTCGAAACAATAAATGGCGCGAAACAAGGTTTAACT 

a LEKDQP NEELCYLPRFVPN * 

b WKKI NQTKS FVI YR AL F QI D - 

c GKR S T KR RAL L F T A L C SK L I - 

TATAAGAGCTAAGAGCACTCTGTACMATATGTCCATTCAT^ 

2761 ----- + + + + + + 2820 

ATATTCTCGATTCTCGTGAGACATGTTTATACAGGTAAGTATTCTACGTCCTTCGGTCCT 

a YKS*EH SVQI C PFIRCRKPG - 
b IRAKSTLYKYVHS*DAGSQ E- 
c *E LRALCTNMS I HK M QEARK- 

AGGTTGTTCCAGTGAAGTGATCAACTTTCGACATAGCCACAAAACTAGAGATTATGTAAT 

2821 + + + + + — + 2880 

TCCAACAAGGTCACTTCAGTAGTTGAAAGGTGTATCGGTGTTTTGATCTCTAATACATTA 

Fig. 2G 
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a RLFQ * S HQLS T * PQN * R L 
b G CSS E V I N F P H S K T^ 1 



c 



V V P 



V KS S TF HIATKLE I 



CATAAAAACCAAACTATCCGCGA 

2881 -- +— + 2903 

GTATTTTTGGTTTGATAGGCGCT 

a H K H Q T I R 

b IKTKLSA 

c * K P N Y P R - 

ENZYMES THAT DO CUT : 

NONE 

ENZYMES THAT DO NOT CUT: 
Kpnl 



Fig. 2H 
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-146 . 

ATCGATTGATCTCTGGCTCAGTGCGAGTAGTCCATTTGAGAGCAGTCGTAGCCCCGCGTG - 8 6 



GCGCATCATGGAGCTATTTGGAATTTTCGCAGGGTTATGGATTCGTAGTGGGAACCCATT -26 



CATTGTTTGGAACCACCAACGGACGACTTAACAAGCTCCC CGAGGT GCATGATGAAAATT 3 5 

MetLysIle 

■ — i — m m. .-i — ii ii i- - ~ • • . * 

GCTCCAGTTGCCATAAATCACAGCCCGCT 95 
AlaProValAlalleAsnHisSerProLeuSerArgGluValProSerHisAlaAlaPro 



ACTCAGGCAAAGGAAACCAACCTTCAATCTGAAGCTGGCGAT^ 155 
ThrGlnAlaLysGlnThrAsnLeuGlnSerGluAlaGlyAspLeuAspAlaArgLysSer 



AGCGCTTCAAGCGCGGAAACCCGCGCATT ACTCGCTACT AAGACAGT ACTCGGGAGACAC 215 
SerAlaSerSerProGluThrArgAlaLeuLeiaAlaThrLysThrValLeuGlyArgHis 

• « • « • • 
AAGATAGAGGTTCCGGCCTTTGGAGGGTGGTTCAAAAAGAAATCATCTAAGCACGAGACG 275 
LysIleGluValProAlaPheGlyGlyTrpPheLysLysLysSerSerLysHisGluThr 

• • • ■ • * • 
GGCGGTTCAAGTGCC^ACGCAGATAGTTCGAGCGTGGCTTCCGATTCCACCGAAAAACCT 335 
GlyGlySerSerAlaAsnAlaAspSerSerSerValAlaSerAspSerThrGluLysPro 

• < ■ • • • 
TTGTTCCGTCTCACGCACGTTCCTTACGTATCCCAAGGTAATGAGCGAATGGGATGTTGG 395 
LeuPheArgLeuThrHisValProTyrValSerGlnGlyAsnGluArgMetGlyCysT^ 

• • • • • ■ 
TATGCCTGCGCAAGAATGGTTGGCCATTCTGTCGAAGCTGGGCCTCGCCTAGGGCTGCCG 455 
TyrAlaCysAlaArgMetValGlyHisSerValGliiAlaGlyProArgLeuGlyLeuPro 

• • ■ ■ • • . 
GAGCTCTATGAGGGAAGGGAGGCGCCAGCTGGGCTACAAGATTTTTCAGATGTAGAAAGG . 515 
GluLeuTyrGluGlyArgGlioAlaProAlaGlyLeuGlnAspPheSerAspValGluArg 

• • • * * • 
TTTATTCACAATGAAGGATTAACTCGGGTAGACCTTCCAGACAATGAGAGATTTACACAC 575 
PhelleHisAsnGluGlyLeuThrArgValAspLeuProAspAsnGluArgPheThrHis 



Fig. 3A 
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GAAGAGTTGGGTGCACTGTTGTATAAGCACGGGCCGATTATATTTGGGTGGAAAACTCCG 635 
GluGluLeuGlyAlaLeuLeuTyrLysHisGlyProIlellePheGlyTrpLysThrPro 



AATGACAGCTGGCACATGTCGGTCCTCACTGGTGTCGATAAAGAGACGTCGTCCATTACT 695 
AsnAspSerTrpHisMetSerValLeuThrGlyValAspLysGluThrSerSerlleThr 



TTTCACGATCCCCGACAGGGGCCGGACCTAGCAATGCCGCTCGATTACTTTAATCAGCGA 755 
PheHisAspProArgGlnGlyProAspLeuAlaMetProLeuAspTyrPheAsnGlnArg 

• ■ • • • • • 
TTGGCATGGCAGGTTCCACACGCAATGGTCTACCGCTAAGTAGCAGGGTATCTTCACGTG 815 
LeuAlaTrpGlnValProHisAlaMetLeuTyrArgEnd 

• * ■ * * m 

GCGGCATCATGACAAGCCCATGATGCCGCCAGCAGCTACCTGAATGCCGTCTGGCTTTTT 875 



GGTCCCTATTGTCGTATCCGGAAGATGACGTCAAAGAATCTCGGCAAGAGCTTTCTTGCT 935 



CGACTCCTCAGCTTCCGGATCGATCAGGTCGCTTGCCAGAGCGCGCTTGTCCATGAGCAT 995 



CTGCCACAGCTGCTGGTCGATGGTGTCCTCAGCT AAAGGGATTTTGACGACAACCATGCG 1055 



CAACTGCCCGTTGCGATACGCTCGATCCTGAAGCCCCGGTGTCCATGGCAGCCCCAAGAA 1115 



AAAGACATAGTTCGCCGCTGTGAGGTTGTAGCCTGTGCCGGCGGCCGACCTGGTCCCGAT 1175 



AAACACCCTGCAGTCCGGATCCTGCTGGAAAGCATCAATCGCCTTCTGCCGCTTCTTGGG 1235 



CGAGTCACTGCCCACCAACGTCACGCACCCGACGCCAAGCTTGAGGCAGTGCTCCCGCAA 12 95 



• -. ,. _ 



CGTGGCCACGGATTCCTGATACTCGCAGAAGAGGATCACCTTGTCGTCGAC 1346 



Fig. 3B 



(TitleTRPSGENE FAMILY, PRIMERS, PROBES, AND 
iDETECTION METHODS 
jApplicant(s): Ausubeletal. 

Filing Date: July 2, 2003 Serial No::Unknown 

Page 12 of 30 Customer No.: 21 559 



o 

U") 

CO 



cm 



CO 

csi 



CO 



csi 



" • 

CO' 
CO 



OI 

CO 



CO 
csi 

co 



CO 



O 

LU 

o 



lo 
3* 




CO 

3fc 



to 

-op 

CL °- 
1 0) 



CO 

< 



CO 



< 

Q 
u 



O 

I 



8 



< 

o 
t3 



i 

Q 

CL 

to 

LU 



I 

Cl 
10 

CO 



> 



O Q 
^ Q. 

Q to 

CL lu 

- DC 
CO 



CN 

Q 

CL 



i 

Q 

CL 
LO 

LU 

or 



P csi 



to 



o *q 

. to 
to 

CO 



LU 
I 

3 

to 



J 



(Title : RPS GENE FAMILY, PRIMERS, PROBES, AND 
DETECTION METHODS 
Applicants): Ausubel et al. 

Filing Date: July 2, 2003 Serial No. :Unknown j i 

Page 13 of 30 Customed No.: 21 559 ! 



1 50 
L6pro MSYLREVATA VALLLPFILL NKFWRPNSKD SIVNDDDDST SEVDAISDST 



Nprot 

PrfP 

rps2 



M 



51 6 100 

L6prO NPSGSFPSVE YEJVFLSFRGjP DTREQFTDFL YQSLRRYKIM TFRDDDELLK 

Nprot assssssrws ye IvflsfrgI e DTRKTFTSHL YEVLNDKGIK TFQDDKRLEY 

PrfP ...... i LRSKLDLIID LKHQIESVKE 

rps2 MDF2SSLIVG CAQVLCESMN MAERRGHKTD LRQA1TDLET 

101 150 
L6pro GKEIGPNLLR AIDQSKIYVP IISSGYADSK WCLMELAEIV RRQEEDPRRI 
Nprot GATIPGELCK AIEESQFAIV VFSENYATSR VfCLNELVKIM ECK . TRFKQT 

PrfP GLLCLRSFID HFSESYDEHD EA CGLIARVSVM AYKAE 

rps2 AIGDLKAIRD DLTLRIQQDG LEGRSCSNRA REWLSAVQVT ETKTA 

15 1 7 200 
L6pro ILIPIFYMVDP SpVRHQTGCY KKAFRKHANK F. .DGQTIQN WKDALKKVGD 
Nprot Vl |PIFYDVDP Sj HVRNQKESF AKAFEEHETK YKDDVEGIQR WRIALNEAAN 

PrfP ..... YVIDS CLAYSHPLWY KVLW IS . EVLENIKLV 

rps2 . . . . . LLLVR FRRREQRTRM RRRY . . LSCFGCAD YKLCKKVSAI 



201 



8 250 



L6pro LKGWHIGRND KQGAIADKVS ADIWSHISKE NLILE...TD ELVGIDDHIT 
Nprot LKGSCDNRDK TDADCIRQIV DQISSKLCKI SLSY....LQ NljVGlDTgLE 
PrfP NKWGETCER RNIEVTVHEV AKTTTYVAPS FSAYTQRANE EMEGFQDTID 
rps2 LKSIGELRER SEAIKTDGGS IQVTCREIPI KSWG. NTTMM 



251 



1 



L6pro AVLEKLSLDS ENVTMVGLYlG MGGIGKTTTA 
Nprot KIESLLEIGI NGVRIMG1WI3 MGGVGKTTIA 
PrfP ELKDKLLGGS PELDVISIVS MPGLGKTTLA 



rps2 EQVLEFLSEE EERGIIGVY 



PGGVGKTTLM 



^-loop 300 

KAVYNKI SSC.FDCC 

RAIFDTLLGR MDSSYQFDGA 
KKIYNDPEVT . . SRFDVHAQ 
QSINNELITK . .G, . . .HQY 



301 350 
L6pro CFIDNIRETQ EKDGVWLQK KLVSEILRID . . SGSVGFNN DSGGRKTIKE 

Nprot CFLKDIKE. . ... NKRGMHSLQN ALLSELLR EKANYNN EEDGKHQMAS 

PrfP CWTQLYSWR EL . LLTILND VLEP . . . S DRNEKED GE . IADELRR 

rps2 DVLIWVQMSR EF.GECTIQQ A VGA. . . RLG . . LSWDEKET GENRALKIYR 



351 



2 



3 



400 



L6pro RVSRFKILW LDDVDEKFKF EDMLGSPKDF ISQ.SRFIIT SRSMRVLGTL 
Nprot RLRSKKVLIV LDDIDNKDHY LEYLAGDLDW FGNGSRIIIT TRDKHLI . . . 
PrfP FLLTKRFLIL 1DDVWDYKVW DNLCMCFSD. VSNRSRIILT TRLNDVAEYV 



rps2 ALRQKRFLLL LDDVWEEIDL EKTGVPRPD. RENKCKVMF|T TRSIALCNNM 



Fig. 5A-1 
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401 450 

L6pro NEN.QCKLYE VGSMSKPRSL ELFSKHAFKK NT....PPSY YETLANDWD 

Nprot . EK.NDIIYE VTALPDHESI QLFKQHAFGK EV. . . . PNEN FEKLSLEWN 

PrfP .KC.ESDPHH LRLFRDDESW TLLQKEVFQG E SCPPE LEDVGFEISK 

rps2 .GA.EYK.LR VEFLEKKHAW ELFCSKVWRK DLLESSSIRR LAEI...IVS 

451 4 



2 500 
L6pro TTAGLPLTLK VlIGSLLFKQE IAV. .WEDTL EQL. . . .RRT LNLDEVYDRL 
Nprot YAKGLPLALK VWGSLLHNLR LTE. .WKSAI EHM....KNN . SYSGIIDNV 
PrfP SCRGLPLSW LVAGVLKQKK KTLDSWKWE QSLS..SQRI GSLEESISII 
rps2 KCGGLPLALI TLGGAMAH.R ETEEEWIHAS EVLTRFPAEM KGMNYVFALL 



501 5 9 550 

L6pro iKISYDAliNPE . AKEltFLDIA CFIFIGQ..NK EEPYYMWTDC NFYPASNIIF 



Nprot 

PrfP 

rps2 



KISYDGLEPK .QQEM jFLDIA CFj LRGE. .EK DYILQILESC HIGAEYGLRI 
GFSYKNL.PH YLKPCFLYFG GFLQGKDIHD SKMTKLWVAE EFVQANN. . . 
KFSYDNl( eSD LLRSCFLYCA LFPEEHSIE1 EQLVEYWVGE GFLTSSHGVN 



551 



10 



600 



L6pro LIQRCMIQVG . .DD DEFKMHDQLR DMGREIVRRE DVLPVJKRSRI 

Nprot L1DKSLVFIS EY NQV aMHDLIQ DMGK YIVNFQ KD.PGERSRL 

PrfP .EK GQEDTRTRF. . LGRSYW. 

rps2 TIYKGYFL1G DLKAACLLET GDEKTQVKMH NWRSFALWM ASEQGTYKEL 

601 650 
L6pro WSAEEGIDLL LNKKGSSKVK AISI.PWGVK YEFK.SECFL NLSELRYLHA 
Nprot WLAKEVEEVM SNNTGTMAME AIWVSSYSST LRFS.NQAVK NMKRLRVFNM 

PrfP 

rps2 ILVEPSMGHT EAPKAENWRQ ALVISLLDNR IQTL.PEKL1 CPKLTTLMLQ 

651 700 
L6pro REAMLTGDFN NLLPNLKWLE LPFYKHGEDD PPLTNYTMKN LII.VILEHS 
Nprot GRSSTHYAID YLPNNLRCFV CTNYPW. . . E SFPSTFELKM LVH.LQLRH. 

PrfP 

rps2 QNSSLKKIPT GFFMHMPVLR VLDLSF.... TSITEIPLSI KYL.VELYHL 



701 750 

L6pro HITADDWGGW RHMMKMAERL KWRLASNYS LYGRRVR 

Nprot ... NSL RHLWTETKHL PSL. ....... . . RRID . . . ..... 

PrfP 

rps2 SMSGTKISVL PQELGNLRKL KHLDLQRTQF LQTIPRDAIC WLSKLEVLNL 

751 800 
L6pro .LSD.CWRFP KSIEVLSMTA IEMDEVDIGE LKKLKTLVLK FCPIQKISGG 
Nprot .LSW.SKRLT RTPDFTGMPN LEY . . VNLYQ CSNLEEVHHS LGCCSKVIGL 

PrfP 

rps2 YYSY.AGWEL QSFGEDEAEE LGFADLEYLE NLTTLGITVL SLETLKTLFE 



Fig.5A-2 
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801 850 
L6prO TFGMLKGLRE L.CLEFNWGT NLREWADIG QLSSLKVLKT TGAKEVEINE 

Nprot YLNDCKSLKR F ........ . . PCVNVESLE 

PrfP 

rps2 FGALHKHIQH L.HVEECNEL LYFNLPSLTN HGRNLRRLSI KSCHDLEYLV 

851 900 

L6pro FPLGLK ELSTSSR IPNLSQLLDL EVLKVYDCKD GFDMPPASPS 

Nprot Y.LGLR. ... ... SCDSLEK LPEIYGRMKP EI . . . . ... . . QIHMQGSGIR 

PrfP 

rps2 TPADFENDWL PSLEVLTLHS LHNLTRVWGN SVSQDCLRNI RCINISHCNK 

901 950 



L6pro EDESSVWWKV SKLKSLQLEK TRINVNWDD ASSGGHLPRY LLPTSLTYLK 
Nprot ELPSSIFQYK THVTKLLL . . .WNMKNLVAL PSSICRL. KSLVSLS 

PrfP 

rps2 LKNVSWVQKL PKLEVIELFD CREIEELISE HESPSVEDPT LFP.SLKTLR 

951 1000 
L6pro IYQCTEPTWL P . GIENLENL TSLEVNDIFQ TLGGDLDGLQ GLRSLEILRI 

Nprot VSGCSKLESL PEEIGDLDNL RVFDASDTL ILRP 

PrfP 

rps2 TRDLPELNSI LPSRFSFQKV ETLVITNCPR VKKLPFQERR TQMNLPTVYC 

1001 1050 
L6pro RKVNGLARIK GLKDLLCSST CKLRKFYITE CPDLIELLPC ELGGQTWVP 
Nprot P SSI IRLNKLIILM FRGFKDGVHF EFPPVAEGLH 

PrfP 

rps2 EEKWWKALEK DQPNEELCYL PRFVPN 

1051 1100 
L6prO SMAELTIRDC PRLEVGPMIR SLPKFPMLKK LDLAVANITK EEDLDAIGSL 
Nprot SLEYLNL.SY CNLIDGGLPE EIGSLSSLKK LDLSRNNF. . EHLPSSIAQL 

PrfP 

rps2 

1101 1150 
L6pro EELVSLELEL DDTSSGIERI VSSSKLQKLT TLWKVPSLR EIEGLEELKS 
Nprot GALQSLDLK . .......... . DCQRLTQLP ELPPELNELH .VDCHMALKF 

PrfP . . .. ........ . . 

rps2 

1151 1200 
L6pro LQDLYLEGCT SLGRLPLEKL KE ...... LD IGGCPDLTEL VQTWAVPSL 

Nprot IHDL.VTKRK KLHRVKLDDA HNDTMYNLFA YTMFQNISSM RHDISASDSL 

PrfP 

rps2 

Fig. 5A-3 
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1201 1250 
L6prO RGLTIRDCPR LEVGPMIQSL PKFPMLNELT LSMVNITKED ELEVLGSLEE 
Nprot .SLTV FTGQPYPEKI PSWFHHQGWD .SSVSVNLPE NWYIPDKFLG 

PrfP • 

rps2 .......... 

1251 1300 



L6pro LD.SLELTLD DTCSSIERIS FLSKLQKLTT L1VEVPSLRE IEGLAELKSL 
Nprot FAVCYSRSLI DTTAHLIPVC . DDKMSRMTQ KLALSECDTE SSNYSEWD.I 

PrfP 

rps2 

1301 1350 

L6pro RILYL. .... . EGCTSLERL WPDQQQLGSL KNLNVLDIQG 

Nprot HFFFVPFAGL WDTSKANGKT PNDYGIIRLS FSGEEKMYGL RLLYKEGPEV 

PrfP ..... • 

rps2 

1351 1387 
L6pro CKSLSVDHLS ALKTTLPPRA RITWPDQPYR ....... 

Nprot NALLQMRENS NEPTEHSTGI RRTQYNNRTS FYELING 

PrfP . 

XpS2 • • • • • •-• *-»rr 9 -»- m • • • • » • • • • • • • • • • • 



Fig. 5A-4 
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N 2 

L6 51 

N 52 

L6 101 

N 101 

L6 151 

N 151 

L6 199 

N 200 

L6 249 

N 250 

L6 293 

N 295 

L6 343 

N 341 
L6 392 



AS S S S S SRWSYINFLSFRGEDTRKTFTSHLYEVLNDKGI KTFQDDKRLEY 5 1 

..'1*1 .. II- lh I- • I'll- I 

NPSGSFPSVEYl jVFIiSFRG| PDTREQFTDFLYQSLRRYKIMTFRDDDELLK 100 



GATIPGELCRAIEESQFAIWFSENYATSRWCLNELVKIMECK.TRFKQT 100 
GKEIGPNLLRAIDQSKIYVPIISSGYADSKWCLMELAEIVRRQEEDPRRI 150 

7 • • • • • 



VI P I FYDVDPS HVRNQKESF AKAFEEHETKYKDDVEGI QRWRI ALNE AAN 150 

Hi II I lll|.|l«|.Mi III .|..h |.--.||.|: | | — * « 

I i jPIFYMVDPSl DVRHQTGCYKKAFRKHANKF . . DGQTI QNWKDALKKVGD 198 

• • • • 8 • 



LKG SCDNRDKTDADC I RQ I VDQ I S SKLCKI SLS Y . LQNI VGIDTH LEKIE 199 
III • • • • • • I • • I I •'• I I I I • 

111* ••••••••• • • • • j • | • • • j • | • •••II | • | • « • 

LKGWHI6KNDKQGAIADKVSADIWSHISKENLILETDELVGIDDHITAVL 248 



m w w ■ ■ 

SLLEIGINGVRIMGIVASMGGVGKTTIARAIFDTLLGRMDSSYQFDGACFL 249 

. |.:: :.| : : | : : 1 1 1 1 : | \ \ \ . | : | : : : . s ||: ||..||:. 

EKLS LDS ENVTMVGLYjGMGGI GKTTT AKAVYNKI . .... SSC . FDCCCFI 292 

• • • • • 

KDIKE . . NKRGMHSLQNALLSELLR . . . EKANYNNEEDGKHQMASRLRSK 294 

.:|:| :| |: ||. h||:|| : . . . : | | : . : | : . : . | : . . 
DNIRETQEKDGVWLQKKLVSEILRIDSGSVGFNNDSGGRKTIKERVSRF 342 

• • • • • 

KVLIVLDDIDNKDHYLEYLAGDLDWFGNGSRIIITTRDKHLI . . . .EKND 340 

I « I * 1 1 1 1 * I * I - s ! h«. I — - l|i|H.|-.«» I-- 

KILWLDDVDEKFKFEDMLGSPKDFISQ . S RF 1 1 T SRSMRVLGTLNENQC 391 

• • • • ■ • ■ 

IIYEVTALPDHESIQLFKQHAFGKEVPNENFEKLSLEWNYAKGLPLALK 390 

:|||..|... |::||..||| h-| . « I • I - i||i • I I i! - I I 
KLYEVGSMSKPRSLELFSKHAFKKNTPPSYYETLANDWDTTAGLPLTLK 441 



N 391 VWGSLLHNLRLTEWKSAIEHMKNN. SYSGIIDNVKISYDGLEPKQQEMFL 439 



INI . 



|. ..:)::: : : . | : 1 1 1 1 1 1 : | : | . ... | : 



L6 442 VIGSLLFKQEIAVWEDTLEQLRRTLNLDEVYDRLKISYDALNPEAKEIJFL 491 

9 • • • • 



n 



N 440 DIACFLRGEEKDYILQILESCHIGAEYGLRILIDKSLVF1SEYNQVQMHD 489 

ITII h h:|: . ::-|:: : : :||::::: ::: ::.7||| 

L6 492 DI ACFFI GQNKEEP YYMWTDCNF YP ASNI I FLI QRCMI QVGDDDEFK MHD 5 4 1 



N 490 LIQDMGKYIVNFQKD . PGERSRLWLAKEVEEVMSNNTGTMAMEAIWVSSY 538 

:.|||: ||. :. | .|||:| |.|. ::: |..|. :.||.: .: 
L6 542 QLRDMGREIVRREDVLPWKRSRIWSAEEGIDLLLNKKGSSKVKAISI.PW 590 
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N 539 

L6 591 

N 587 

L6 641 

N 622 

L6 691 

N 666 

L6 741 

N 714 

L6 782 



SSTLRFSNQAVKNMKRLRVFNMGRSSTHYAIDYLPNNLRCFVCTNYPW. . 586 
I . I . I I . . ..III.. I 

• • • ]*•••• I • ■ II • « • • • • I II* * * 1* 

GVKYEFKSECFLNLSELRYLHAREAMLTGDFNNLLPNLKWLELPFYKHGE 640 

• « # • • 

. ESFPSTFELKMLVHLQLRH NSLRHLWTETKHLPSL 621 

. . I I . . I I ..II. . | 

• • • • I I •••II* • •••!• • 

DDPP LTNYTMKNLI IVILEHSHI TADDWGGWRHMMKMAERLK WRLASNY 690 



RRIDLSWSKRLTRTPDFTGMPNLEY . . VNLYQCSNLEEVHHSLGCC 665 

1 1 : || : . | : . : . : . : | . . : | | : : : . . | . . : . : . . . 

SLYGRRVRLSDCWRFPKSIEVLSMTAIEMDEVDIGELKKLKTLVLKFCPI 740 

• • • • • 

SKVIGLYLNDCKSLKRFPCVNVESLEYLGLRSCDSLEKLPEIYGRMKP. . 713 
I.I . 1.1. . . j . . I 

I • I • • 1*1* * •!«•«•«•• | •••• •!•••• 

QKISGGTFGMLKGLREL . . CLEFNWGTNLREWADIGQLSSLK 781 

• • • • • 

. . , EIQIHMQGSGIRELP. SSIFQYKTHVTKLLLWNM. . . . KNLV 750 

|**|« • | • * J | • J| •• ••• • | •••• ••• 

VLKTTGAKEVEINEFPLGLKELSTSSRIPNLSQLLDLEVLKVYDCKDGFD 831 



11 



N 751 ALPSSICRLRSLVSLSVSGQSKLESLlPEEIGDLPNLRVFPASDTLILRP. 799 
L6 832 



1-1 : -I |.:.| 
MPPASPSEDESSVWWKV. . . 



I II. lit I • M I III-- I-- 
SKLKSLQLEKTRINVNWDDASSGGHLPRY 878 



N 800 

L6 879 

N 831 

L6 928 

N 876 

L6 978 



P S S 1 1 RLNKL 1 1 LMFRGFKDGVHFEFP P VAE 830 

• ) 1*1 I •••• • ' • 

• •I •[••(• I ••■• ••• ••••• • 

LLPTSLTYLKIYQCTEPTWLPGIENLENLTSLEVNDIFQTLGGDLDGL.Q 927 

12 • • • • 



GLHSLEYLNLSYCNLID . . GGLPEEI . GSLSSLKKLDL . . SRNNFEHLPS 875 



• • I • ♦ 



hi 



:| lh 



GLRSLEILRIRKVNGLAR1KGLKDLLCSSTCKLRKFYITECPDLIELLPC 977 



SIA QLGALQSLDLKDCQRLTQLPELPPELNELHVDCHMALKFIHYL 921 

• •• ••••••|«*«||«|| ••••■ **| •••• || ••• 

ELGGQTWVPSMAELTIRDCPRL . EVGPMIRSLPKFPM LKKLDLA 1021 



N 922 VTKRKKLHRVKLDDAHNDTMYNLFAYTMFQNISSMRHDISASDSLSLTVF 971 

| • •" • | ""•*' •• •• •• • ••••• •• )** * * I # | " *| 

L6 1022 VANITKEEDLDAIGSLEELV. . SLELELDDTS SGI ERI VS SSKLQKL 
N 972 
L6 1070 



TL 1069 



TGQPYPEKIPSWFHHQGWDSSVSVN. L P ENWY I PDKFLGFAVC Y 1014 

I •||«' •!•• I* • •)•••• 

• lit • •!••• | * * * •■• • I • ••••• 

W KVPSLREIEGLEELKSLQDLYLEGCTSLGRLPLEKLKELDIGG 1114 
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N 1015 SRSLIDTTAHLIPVCDDK MSRMTQKLA. . . .LSECDTES 1049 

., :::|.. : |.| . 

L6 1115 CPDLTELVQTVVAVPSLRGLTIRDCPRLEVGPMIQSLPKFPMLNELTLSM 1164 



• • • 



N 1050 SNYSEWDIHFFFVPFAGLWDTSKANGKTPNDYGIIRLSFSGEEKMYGLRL 1099 

|... | .:..:.:| . . . :.|....: |.: :| :|: ,\ : 

L6 1165 VNITKEDELEVLGSLEELDSLELTLDDTCSSIERISF . LSKLQKLTTLIV 1213 

• - « • • 

N 1100 LYKEGPEVNALLQMRENSNEPTEHSTGIRRTQYNNRTSFYELIN 1143 

Im 0 « -I • '• • I • I • • I ■ * • • II 

• • • I « I • I • • I ♦ t • • • I 

L6 1214 EVPSLREIEGLAELKSLRILYLEGCTSLER.LWPDQQQLGSLKN 1256 
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10 I 20 I 30 | 40 I 50 | 60 

aagctttaca gattggatga tctcttaatg catgctgaag tgactgcaaa aaggttagca 60 

atattcagtg gttctcgtta tgaatatttc atgaacggaa gcagcactga gaaaatgagg 120 

cccttgttat ctgattttct gcaagagatt gagtctgtca aggtagagtt cagaaatgtt 180 

tgcttgcaag ttctggatat atcacctttt tccctgacag atggagaagg ccttgttaat 240 

ttcttattaa aaaaccaggc caaggtgccg aatgatgatg ctgtttcttc tgatggaagt 300 

ttagaggatg caagcagcac tgagaaaatg ggacttccat ctgattttct ccgagagatt 360 

gagtctgttg agataaagga ggccagaaaa ttatatgatc aagttttgga tgcaacacat 420 

tgtgagacga gtaagcacga tggaaaaagc tttatcaaca ttatgttaac ccaacaggac 480 

aaggtgctgg actatgatgc tggttcagtg tcttatcttc ttaaccaaat ctcagtagtt 540 

aaagacaaaa tattgcacat tggctcttta cttgtagata ttgtacagta ccggaatatg 600 

catatagaac ttacagatct cgctgaacgt gttcaagata aaaactacat tcgtttcttc 660 

tctgtcaagg gttatattcc tgcttggtat tacacactat atctctctga tgtcaagcaa 720 

ttgcttaagt ttgttgaggc agaggtaaag attatttgtc tgaaagtacc agattcttca 780 

agttatagct tccctaagac aaatggatta ggatatctca attgcttttt aggcaaattg 840 

gaggagcttt tacgttctaa gctcgatttg ataatcgact taaaacatca gattgaatca 900 

gtcaaggagg gcttattgtg cctaagatca ttcattgatc atttttcaga aagctatgtt 960 

gagcatgatg aagcttgtgg tcttatagca agagtttctg taatggcata caaggctgag 1020 

tatgtcattg actcatgctt ggcctattct catccactct ggtacaaagt tctttggatt 1080 

tctgaagttc ttgagaatat taagcttgta aataaagttg ttggggagac atgtgaaaga 1140 

aggaacactg aagttactgt gcatgaagtt gcaaagacta ccactaatgt agcaccatct 1200 

ttttcagctt atactcaaag agcaaacgaa gaaatggagg gttttcagga tacaatagat 1260 

gaattaaagg ataaactact tggaggatca cctgagcttg atgtcatctc aatcgttggc 1320 

atgccaggat tgggcaagac tacactagca aagaagattt acaatgatcc agaagtcacc 1380 

tctcgcttcg atgtccatgc tcaatgtgtt gtgactcaat tatattcatg gagagagttg 1440 

ttgctcacca ttttgaatga tgtgcttgag ccttctgatc gcaatgaaaa agaagatgga 1500 

gaaatagctg atgatctacg ccgatttttg ttgaccaaga gattcttgat tctcattgat 1560 

gatgtgtggg actataaagt gtgggacaat ctatgtatgt gcttcagtga tgtttcaaat 1620 

aggagtagaa ttatcctaac aacccgcttg aatgatgtcg ccgaatatgt caaatgtgaa 1680 

agtgatcccc atcatcttcg tttattcaga gatgacgaga gttggacatt attacagaaa 1740 

gaagtctttc aaggagagag ctgtccacct gaacttgaag atgtgggatt tgaaatatca 1800 

aaaagttgta gagggttgcc tctctcagtt gtgttagtag ctggtgttct gaaacagaaa 1860 

aagaagacac tagattcatg gaaagtagta gaacaaagtc taagttccca gaggattggc 1920 

agcttggaag agagcatatc tataattgga ttcagttaca agaatttacc acactatctt 1980 

aagccttgtt ttctctattt tggaggattt ttgcagggaa aggatattca tgactcaaaa 2040 

atgaccaagt tgtgggtagc tgaagagttt gtacaagcaa acaacgaaaa aggacaagaa 2100 

gatacccgca caaggtttct tggacgatct tattggtagg aatctggtga tggccatgga 2160 

gaagagacct aatgccaagg tgaaaacgtg ccgcattcat gatttgttgc ataaattctg 2220 

catggaaaag gccaaacaag aggatttcct tctccagatc aataggtaaa aaaaactgta 2280 

ttaattttac attacaaaaa aaaagaactg tattaatttt actgtattat gtttatgcca 2340 

actctcattt ccatgtgttc tcttttattc aattcagtgg agaaggtgta tttcctgaac 2400 

gattggaaga ataccgattg ttcgttcatt cttaccaaga tgaaattgat ctgtggcgcc 2460 

catctcgctc taatgtccgc tctttactat tcaatgcaat tgatccagat aacttgttat 2520 

ggccgcgtga tatctccttc atttttgaga gcttcaagct tgttaaagtg ttggatttgg 2580 



Fig. 12A 
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2581 aatcattcaa cattggtggt acttttccca ttgaaacaca atatctaatt cagatgaagt 2640 
2641 actttgcggc ccaaactgat gcaaattcaa ttccttcatc tatagctaag cttgaaaatc 2700 
2701 ttgagacttt tgtcgtaaga ggattgggag gagagatgat attaccttgt tcacttctga 2760 
2761 agatggtgaa attgaggcat atacatgtaa atgatcgggt ttcttttggt ttgcgtgaga 2820 
2821 acatggatgt tttaactggt aactcacaat aacctaattt ggaaaccttt tctactccgc 2880 
2881 gtctctttta tggtaaagac gcagagaaga ttttgaggaa gatgccaaaa ttgagaaaat 2940 
2941 tgagttgcat attttcaggg acatttggtt attcaaggaa attgaagggt aggtgtgttc 3000 
3001 gttttcccag attagatttt ctaagtcacc ttgagtccct caagctggtt tcgaacagct 3060 
3061 atccagccaa acttcctcac aagttcaatt tcccctcgca actaagggaa ctgactttat 3120 
3121 caaagttccg tctaccttgg acccaaattt cgatcattgc agaactgccc aacttggtga 3180 
3181 ttcttaagtt attgctcaga gcctttgaag gggatcactg ggaagtgaaa gattcagagt 3240 
3241 tcctagaact caaatactta aaactggaca acctcaaagt tgtacaatgg tccatctctg 3300 
3301 atgatgcttt tcctaagctt gaacatttgg ttttaacgaa atgtaagcat cttgagaaaa 3360 
3361 tcccttctcg ttttgaagat gctgtttgtc taaatagagt tgaggtgaac tggtgcaact 3420 
3421 ggaatgttgc caattcagcc caagatattc aaactatgca acatgaagtt atagcaaatg 3480 
3481 attcattcac agttactata cagcctccag attggtctaa agaacagccc cttgactctt 3540 
3541 agcaaaggtt tgttcttgct gtgttcatcc aagtgcattt aacatttatt cattttgttt 3600 
3601 tacaccagaa catgtttatt ttgctagtat tacttgatac attaaaagaa atcgaactca 3660 
3661 tatttctgct acagtcttaa cttttcttgg gcttacttga ggtctagatt agatcaatgg 3720 
3721 ttcatgtaat ttttaattca ctgtttcatt caactgtctt atgatagttg tgaaatgaca 3780 
3781 atattgttat ccctagccaa atttattatg ttcaaatgaa aactgatgtc acaactactt 3840 
3841 ttttgtgaaa tgtttttgaa ttttttgcta taaaattgac gaattgacag cttctatatt 3900 
3901 tgtcagctaa actctttgtc accagaagtg tatttagaat tactgtggtt ttatgaaaga 3960 
3961 gttctgtaga attttatgct tttgcagaat atagtttaaa acaacaacac ttctctgttt 4020 
4021 cagagatagc agaagctaaa gttcaaggca ttttgtttat ttctagaaca agtggagttc 4080 
4081 ttatgttgaa ttcttgaaaa gaagaagaat caggagcagg taaagttatc tctttttatg 4140 
4141 tttttcttct tttagatgtt atttcttcat cttgaacgtg aacaccgctg aaagcatttt 4200 
4201 aataaaaccg gagagaaaaa taagatcttt ttatataaag cattatcatg taaatatgcc 4260 
4261 taaatccata tggtacaact gtttgacaaa atgatagaga ggggagtttt atagtataag 4320 
4321 taaaacagga ttgagaaaaa aatccttgca cgattttcaa tttctggcca catcacaatg 4380 
4381 tgtgtcaaag ttcccctctt taagtggaac aagcaatcag aaaagctcat tcttatcggt 4440 
4441 gacataccaa taccagctga ctgtctcatc ttggttaact tagccttgct tacttagact 4500 
4501 attagattag ttactaatga actggtaaat tggaaccaaa tgtagttagc ttgatgagct 4560 
4561 ggtagacatg tatatatgaa gatacacgcg taactttagt cgatggttaa tttttcattt 4620 
4621 ttgatttttt ttcttcacag agtatatatg aacttggcct aaaagttttg cttcactaat 4680 
4681 ttaactatta ccgtggatga aacaagcatg gcaacatttt caacaactat cactcaagca 4740 
4741 atgtaaaaaa tggaggttct acgagcggta catgtaagag ttttgtgcac acaagaggtt 4800 
4801 ctgagacttg aaccatccat gtccaaggca gttgagatgc tagtaaagaa agaagaagat 4860 
- 4861 gagcctgcac taattaatct ccctgtatga , atgagagaat gagaaaaaga tggagcttca 4920 
4921 tgaaccaaaa gttacctttt ttttttcttc ttaatggcat tactttgaag cacatgtttg 4980 
4981 ttagttgtaa attgtaatgg tgaagtgttt gtaaatatag ggagtgatat ttgaaagaat 5040 
5041 ggttgtgtta tctttacaaa ccggaatcat ttctgtataa ttttcttctg taatttttgg 5100 
5101 tttcggttta ttcattactc atttcagtaa gctt 5134 
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